Using constituency and dependency parse features to identify errorful words in disordered language
نویسندگان
چکیده
Delayed or disordered language is a characteristic of both autism spectrum disorder (ASD) and specific language impairment (SLI). In this paper, we describe our data set, which consists of transcribed data from a widely used clinical diagnostic instrument (the ADOS) for children with ASD and children with SLI. These transcripts are manually annotated with SALT, an annotation system that applies a descriptive code to errorful words. Here we address a step in automating SALT annotation: identifying the errorful words in sentences that are known to contain an error. We propose a set of baseline features to identify errorful words, and investigate the effectiveness of adding features extracted from dependency and constituency parses. We find that features from both types of parses improve classifier performance above our baseline, both individually and in aggregate.
منابع مشابه
تبدیل خودکار درختبانک وابستگی فارسی به درختبانک سازهای
There are two major types of treebanks: dependency-based and constituency-based. Both of them have applications in natural language processing and computational linguistics. Several dependency treebanks have been developed for Persian. However, there is no available big size constituency treebank for this language. In this paper, we aim to propose an algorithm for automatic conversion of a depe...
متن کاملA Comparison of Alternative Parse Tree Paths for Labeling Semantic Roles
The integration of sophisticated inference-based techniques into natural language processing applications first requires a reliable method of encoding the predicate-argument structure of the propositional content of text. Recent statistical approaches to automated predicateargument annotation have utilized parse tree paths as predictive features, which encode the path between a verb predicate a...
متن کاملTree Representations for Chinese Semantic Role Labeling
We compare different parse tree representations for the task of Chinese Semantic Role Labeling (SRL), including dependency and constituency parse trees, two tree pruning methods, and neighbor features. Three learning models are compared. By using SVM classifier with neighbor features and pruning tree to phrase level we achieve significantly better speed and accuracy than state of the art Chines...
متن کاملConvolution Kernels for Subjectivity Detection
In this paper, we explore different linguistic structures encoded as convolution kernels for the detection of subjective expressions. The advantage of convolution kernels is that complex structures can be directly provided to a classifier without deriving explicit features. The feature design for the detection of subjective expressions is fairly difficult and there currently exists no commonly ...
متن کاملSemantic Role Labeling Using Dependency Trees
In this paper, a novel semantic role labeler based on dependency trees is developed. This is accomplished by formulating the semantic role labeling as a classification problem of dependency relations into one of several semantic roles. A dependency tree is created from a constituency parse of an input sentence. The dependency tree is then linearized into a sequence of dependency relations. A nu...
متن کامل